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Abstract 

A general class of unidirectional transforms is presented that can be computed in a distributed 
manner along an arbitrary routing tree. Additionally, we provide a set of conditions under which these 
transforms are invertible. These transforms can be computed as data is routed towards the collection 
(or sink) node in the tree and exploit data correlation between nodes in the tree. Moreover, when used 
in wireless sensor networks, these transforms can also leverage data received at nodes via broadcast 
wireless communications. Various constructions of unidirectional transforms are also provided for use 
in data gathering in wireless sensor networks. New wavelet transforms are also proposed which provide 
significant improvements over existing unidirectional transforms. 

Index Terms 

Data Compression, Wavelet Transforms, Wireless Sensor Networks 

EDICS Category: SEN-DIST 
I. Introduction 

In networks such as wireless sensor networks (WSNs), one major challenge is to gather data from a set 
of nodes and transfer it to a collection (or sink) node as efficiently as possible. Efficiency can be measured 
in terms of bandwidth utilization, energy consumption, etc. We refer to this as the data gathering problem. 
The gathering is typically done in data gathering rounds or epochs along a collection of routing paths 
to the sink, i.e., in every epoch each node forwards data that it has measured along a multi-hop path 
to the sink. A simple gathering strategy is to have each node route raw data to the sink in a way that 
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minimizes some cost metric, e.g., number of hops to the sink, energy consumption. This minimizes the 
amount of resources nodes use to transfer raw data to the sink and is the basis for many practical systems 
used in WSN such as the Collection Tree Protocol (CTP) [H. However, it has been recognized in the 
literature 111, 131 that, in a WSN, (i) spatial data correlation may exist across neighboring nodes and 
(ii) nodes that are not adjacent to each other in a routing path can still communicate due to broadcasted 
wireless transmission^^- Raw data forwarding does not make use of these two facts, thus, it will not be 
the most efficient data gathering method in general. 

When spatial data correlation exists, it may be useful to apply in-network compression distributed 
across the nodes to reduce this data redundancy 0- More specifically, nodes can exchange data with 
their neighbors in order to remove spatial data correlation. This will lead to a representation requiring 
fewer bits per measurement as compared to a raw data representation, also leading to reduced energy 
consumption, bandwidth usage, delay, etc. Since nodes in a WSN are severely energy-constrained [21, 
El, in, some form of in-network processing that removes data redundancy will help reduce the amount 
of energy nodes consume in transmitting data to the sink. In this way the lifetime of a WSN can be 
extended. This could also be useful in bandwidth-limited applications Q, HI. 

Generally speaking, distributed spatial compression schemes require some form of data exchange 
between nodes. Therefore, one needs to select both a routing strategy and a processing strategy. The 
routing strategy defines what data communications nodes need to make and the processing strategy defines 
how each node processes data. There are a variety of approaches available, e.g., distributed source coding 
(DSC) techniques S, (lOl, transform-based methods like Distributed KLT O, Ken Ull, FAQ Ull, and 
wavelet-based approaches IHl, HSl, Ull, HH, HH, IHl, HOl. Note that DSC techniques do not require 
nodes to exchange data in order to achieve compression. Instead, each node can compress its own data 
using some statistical correlation model. Note, however, that an estimate of these models must be known 
at every node, so nodes will still need to do some initial data exchange in order to learn the models (after 
which compression can be done independently at each node). Our work only considers transform-based 
methods, which use linear transforms to decorrelate data while distributing transform computations across 
different nodes. While we do not consider DSC approaches, our algorithms could be useful in the training 
phase of these methods to estimate correlation. Ken and PAQ are examples of approaches we consider, 
where data at each node is predicted using a linear combination of measurements from the node and 
measurements received from its neighbors. Similarly, the Distributed KLT, wavelet-based methods and 

'Data transmissions in a WSN are typically broadcast (4), (5), so multiple nodes can receive a single data transmission. 
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many other related methods also use linear transforms to decorrelate data. Therefore, we can restrict 
ourselves to linear in-network transforms while still encompassing a general class of techniques. 

Many of the existing transform-based methods propose a specific transform first, then design routing 
and processing strategies that allow the transform to be computed in the network. Some examples are the 
wavelet transforms proposed in |[T4ll . |[T5l . ifTSl . the Distributed KLT, Ken and PAQ. While these methods 
are good from a data decorrelation standpoint, the routing and processing strategies that are used to 
facilitate distributed processing may not always be efficient in terms of data transport cost. In particular, 
nodes may have to transmit their own data multiple times |[T4l . ITSll . nodes may need to transmit multiple 
copies of the same coefficients ifTSl . or nodes may even need to transmit data away from the sink ifTTI . 
|[T4l . lITSl . As discussed in |[T5l . this sort of strategy can outperform raw data gathering for very dense 
networks, but it can lead to significant communication overhead for small to medium sized ones. 

The results of our previous work |[20l . ||2T1 and of [151 demonstrate why transport costs cannot be 
ignored. One simple way to work around these issues is to first design an efficient routing tree (e.g., a 
shortest path routing tree, or SPT), then allow the transform computations to occur only along the routing 
paths in the tree. We call these types of schemes en-route in-network transforms. These transforms (e.g., 
the wavelet transforms in lfT6l . lUTl . lITSl . |[T9l . |[20l ) will typically be more efficient since they are 
computed as data is routed to the sink along efficient routing paths. In addition to overall efficiency, 
these transforms can be easily integrated on top of existing routing protocols, i.e., a routing tree can be 
given by a protocol, then the transform can be constructed along the tree. This allows such schemes to 
be easily usable in a WSN - as demonstrated by the SenZip Il22l compression tool, which includes an 
implementation of our algorithm in 1201 - as well as other types of data gathering networks Q, 0. 

We note that all existing en-route transforms start from well-known transforms, then modify them to 
work on routing trees. Instead, in this work we start from a routing tree T and additional links given by 
broadcast (e.g.. Fig. [T|). We then pose the following questions: (i) what is the full set of transforms that 
can be computed as data is routed towards the sink along T and (ii) what are conditions for invertibility 
of these transforms? The main goal of this work is to determine this general set of invertible, en-route 
in-network transforms. Note that in many transform-based compression systems, design or selection of a 
transform is considered separately from the design of a quantization and encoding strategy. This is done 
in practice in order to simplify the system design (e.g., |[23l ). In general certain properties of the transform 
(energy compaction, orthogonality) can serve as indicators of achievable performance in the lossy case. 
We adopt a similar approach in our work, choosing to only focus on the transform design. Simple 
quantization and encoding schemes can then be applied to the transform coefficients, as demonstrated in 
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our experimental results. Joint optimization of routing and compression is also possible, as in |[24l . ll25l 
and our previous work lIlTl . but this is beyond the scope of this work. 




After adding 
broadcast links 12^ 
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Forwarding Link 


> 


Broadcast Link 



Fig. 1. Example of routing tree and a tree augmented with broadcasts. Solid arrows denote forwarding links along the tree 
and dashed arrows denote broadcast links. 



In order to formulate this problem, we first note that the data gathering process consists of data 
measurement at each node and routing of data to the sink along T done in accordance with some 
transmission scheduling, i.e., nodes transmit data along T in a certain order. Also note that data is 
only transmitted along T in the direction of the sink, i.e., data transmissions are unidirectional towards 
the sink. Moreover, each node can only process its own data with data received from other nodes that 
transmit before it, i.e., processing of data must be causal in accordance with the transmission schedule. 
In particular, before each node transmits it will only have access to data received from nodes that use 
it as a relay in a multi-hop path to the sink (i.e., "descendants") and nodes whose data it receives but 
is not responsible for forwarding to the sink (i.e., "broadcast" neighbors). Whenever broadcast is used, 
data from a single node will often be available at multiple nodes. While this can help to decorrelate 
data even further (since more data will be available for transform computations at each node), it would 
be undesirable to transmit this same piece of data through multiple paths since this would increase the 
overall communication cost. Thus, in addition to causality and unidirectionality, the transform should 
also be critically sampled, i.e., the number of transform coefficients that are computed and routed to the 
sink is equal to the number of nodes in the network. We refer to causal, critically-sampled transforms 
that are computed in a unidirectional manner as unidirectional transforms. 

As we will show, unidirectional transforms can be defined in terms of the routing tree, the broadcast 
links induced by the routing and the transmission schedule. Thus, given a tree and transmission schedule, 
the main problem we address in this work is to determine a set of necessary and sufficient conditions 
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under which an arbitrary unidirectional transform is invertible. While unidirectional transforms have been 
proposed, to the best of our knowledge, none of the existing works have attempted to define the most 
general set of unidirectional transforms, nor has any attempt been made to find conditions under which 
such transforms are invertible. Our proposed theory also incorporates the use of broadcast data in a general 
setting. This leads us to develop transforms that use broadcasts in a manner not previously considered. 
This contribution is discussed in detail in Section |II1 

In the context of wavelet transforms for WSNs, early work ifTBl . ifTTIl . ifTSll . (T9\ developed unidirec- 
tional wavelet transforms on ID routing paths in WSNs. Extensions to 2D routing paths on arbitrary 
routing trees were made by the authors in |[20l . lIlTl . The superiority of these ID ifTSl and 2D ll20l 
transforms over the method in lITSl (which requires a great deal of backward communication) was 
demonstrated in |[20l . General unidirectional transforms were initially proposed by us in ||26l . in the 
context of lifting transforms ll27l . and conditions for single-level invertible unidirectional lifting transforms 
were initially proposed there. However, no invertibility conditions were provided for general unidirectional 
transforms, nor were any conditions given for invertible multi-level unidirectional lifting transforms. We 
provide such conditions here (Section JI] and IIII-CI ) as well as new transform designs (Section HVl ) which 
outperform our previously proposed transforms. 

General unidirectional transforms with a set of necessary and sufficient invertibility conditions are 
presented in Section Jll In order to demonstrate the generality of our proposed theory. Section JII] shows 
how existing unidirectional transforms (e.g., the tree -based KLT ll28l . tree-based differential pulse code 
modulation (T-DPCM) |[28l . |[22l and lifting transforms 1261 . ||28l ) can be mapped into our framework. 
Moreover, our proposed formalism is used to construct general unidirectional lifting transforms. Some of 
the inefficiencies of existing lifting transforms are then discussed. In order to address these inefficiencies, 
we define a new Haar-like wavelet transform in Section |IV] which is analogous to the standard Haar 
wavelet when applied to ID paths. As is shown in Section |IVl our formalization guarantees invertibility 
of these Haar-like transforms, and also leads to an extension which incorporates broadcast. Section |V] 
provides experimental results that demonstrate the benefits of using our proposed transforms. 

II. En-route In-network Transforms 

In this section, assuming a fixed routing tree T and schedule t{n) are given, we provide a definition 
of unidirectional transforms and determine conditions for their invertibility. Some notation is established 
in Section ITl-AI Unidirectional transforms are then defined in Section Hl-B I Section ITl-C I presents a set of 
conditions under which these transforms are invertible. Throughout this discussion, the configuration of 
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the network in terms routing and scheduling is assumed to be known. Section III-DI addresses how this can 
be achieved in practice and how our approach can be used with decentralized initialization approaches. 

A. Notation 

Assume there are N nodes in the network with a given routing tree T = (y,ET), where V = 
{1,2, . . . , N, N + 1}, each node is indexed by n G I = {1, 2, . . . , N}, the sink node is indexed by 

+ 1, and {m,n) € Et denotes an edge from node m to node n along T. We also assume that there 
is a graph G = {V,E) which is defined by the edges in Et and any additional edges that arise from the 
broadcast nature of wireless communications. An example graph is shown on the right side of Fig. [T] We 
observe that data gathering consists of three key components. The first is data measurement, where each 
node n measures some scalar data x{n) that it must send to the sink in each epoch (these ideas can be 
easily generalized to non-scalar dato- Additionally, node n must route its data to the sink along T. The 
tree T is defined by assigning to every node n a parent p{n). We assume that these trees are provided by a 
standard routing protocol such as CTP. Finally, we assume that data transmissions are scheduled llH, ||29l 
in some manner, i.e., node n will transmit data to its parent p{n) at time t{n) according to a transmission 
schedule (see Definition [B. CTP is a practical example that can be viewed in terms of this formalization: 
nodes are assigned parents in a distributed manner, data is forwarded to the sink along the corresponding 
routing paths and the times at which nodes transmit serve as an implicit transmission schedule. 

Definition 1 (Transmission Schedule): A transmission schedule is a function t : X ^ {1, 2, . . . , Mgiot), 
such that t{n) = j when node n transmits in the j-th time sloj^ Moreover, node n transmits data before 
node m whenever t{n) < t{m). 

Note that, along the tree T, each node has a set of descendants D„ which use node n as a data relay 
to the sink and a set of ancestors An that node n uses for relaying data to the sink. Moreover, we only 
consider to be descendants of n those nodes that are descendants on the tree and transmit earlier than n. 
Also let each node n be h{n) hops away from the sink node, i.e., n has depth h{n) in T. We also let 
denote the descendants of n which are exactly k hops away from n, i.e., = {m € T>n\p^{m) = n}, 
where p^{m) is the fc-th ancestor of node m (e.g., p^{m) is the parent of m, p'^{m) is the grandparent 

^One straightfoward extension is to use a "separable" transform, where a transform is first applied in one dimension (e.g., 
over time or across dimensions of a multivariate input) and then in the other (i.e., spatially). 

'Note that these time slots are not necessarily of equal length; they simply allow us to describe the order in which 
communications proceed in the network; before time slot t{n), node n is listening to other nodes, and at time t{n) node 
n starts transmitting its own data, and potentially data from its descendants in the routing tree. 
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of m, etc). For instance, C\ is the set of children of n, is the set of grandchildren of n, etc, and 
for simplicity we let Cn = C^. Also note that data can be heard via broadcast in many networks (e.g., 
WSNs), so we let bI define the fiill set of broadcast neighbors whose data node n can overhear due to 
broadcast. 

Under this formulation, each node n can process its own data x{n) together with data received from Vn 
and Bn- This yields transform coefficient y{n) for node n and transform coefficients for its descendants, 
i.e., y{m) for all m E P„. We make an abuse of notation by letting y{T>n) = {y{m)\m € T>n}- Note that 
node n is only responsible for forwarding y{n) and y{'Dn) to its parent p{n), thus, it should not transmit 
any data received from broadcast neighbors. In particular, we assume that node n transmits the transform 
coefficient vector y„ = [y{n) y{'Dn)]'' to its parent p(n) at time t{n). We refer to this as critical-sampling, 
where in each epoch only one transform coefficient per sample per node is generated and then transmitted 
to the sink. In our formulation, we also allow y{n) (and y{T>n)) to be further processed at the ancestors 
of n. We refer to this type of processing as delayed processing. 

Note that data is only transmitted along T towards the sink, i.e., data relay is unidirectional towards the 
sink. The existence of a transmission schedule - given explicitly or implicitly - also induces a notion of 
causality for transform computations. In particular, the computations performed at each node n can only 
involve x{n) and any received from a node m that transmits data before node n. More specifically, 
nodes can only use data from ra ^ Bn if t{m) < t{n) (we assume that t{m') < t(n) for all m! € T>n)- 
These constraints (i.e., causality and unidirectional relay) induce causal neighborhoods whose data each 
node n can use for processing, where we let Bn = {m G Bn\t{m) < t{n)} denote the set of causal 



broadcast neighbors. These can be abstracted as in Fig. |2] where yx>„ 



yc„(i) • • • yc„(ic„i) 



and 



• • • yB„(ie„i) 



. These are formally defined as follows. 




Fig. 2. Example of causal neighborhoods for each node. Node n receives y-D„ and yB„ from T>„ and Bn, respectively, 
processes x{n) together with yE>„ and yB„, then forwards its transform coefficient vector y„ through its ancestors in An- 
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Definition 2 (Causal Neighborhoods): Given a routing tree T and schedule t{n), the causal neigh- 
borhood of each node n is the union of the descendants P„ and the set of causal broadcast neighbors 
Bn = {m S Bl\t{m) < t{n)}, i.e., P„ U Bn- We also define Bn = Bn UmeB„ T^m for future discussions. 

These ideas are illustrated in Fig. |3] For instance, when node 2 forwards data to node 1, its commu- 
nication is also overheard by nodes 4 and 12. However, nodes 4 and 12 will not receive data from node 
2 before they transmit, thus, they cannot use it for processing. 




After removing 
forbidden links 





> 




t(12 


• 
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Sensor Node 




Fonwarding Link 


-> 


Broadcast Link 




Fig. 3. Illustration of causal neighborhoods. Node n transmits at time t(n). The left figure shows the full communication 
graph. The right figure shows the graph after removing broadcast links that violate causality and step by step decoding. 



B. Definition of Unidirectional Transforms 

We define a unidirectional transform (not necessarily invertible) as any transform that (i) is computed 
unidirectionally along a tree T and (ii) satisfies causality and critical sampling. Now we can establish the 
general algebraic form of unidirectional transforms. Without loss of generality, assume that node indices 
follow a pre-order numbering [30| on T, i.e., D„ = {n + 1, n + 2, . . . , n + jX'nl} for all n (see Fig. |3] 
for an example of pre-order numbering). A pre-order numbering always exists, and can be found via 



standard algorithms 11301 . For the sake of simplicity, we also assume that the transmission schedule t 
provides a unique time slot to each node^ i.e., t{n) ^ t{m) for all n ^ m. 

Recall that each node n receives yx)„ and y^,, from its descendants and (causal) broadcast neighbors, 
respectively (see Fig. |2l). Thus, in a general unidirectional transform, each node n processes its own 
data x{n) along with y©^ and yg^. Then, it will transmit transform coefficient vector y„ at time t{n). 

''We note that the time slot assignment need not be unique. However, this assumption significantly simplifies the transform 
construction and invertibility conditions. It is easy to develop similar transform constructions when multiple nodes are assigned 
the same time slots, and similar invertibility conditions arise. 
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We omit t{n) from the notation of y„ since the timing is impUcit. In order to satisfy critical-sampUng, 
it is necessary that each node only forward 1 + jP^] coefficients to the sink. Therefore, must be a 
(1 + |I^n|) X 1 dimensional vector. A unidirectional transform can now be expressed as follows. 

Definition 3 (Unidirectional Transform): Let T be a routing tree with a unique time slot assignment 
given by t{n), and suppose that the causal neighborhood of each node is given by Definition |2l A 
unidirectional transform on T is a collection of local transformations done at each node n given by 

x{n) 



-rt-n -D^ . . . JD„ 



(1) 



where y„ has dimension (1 + x 1, A„ has dimension (1 + l^nl) x (1 + jl'nl) and each B^ has 

dimension (1 + |I^n |) x (1 + The transform is computed starting from the node at the first time 

slot up through the nodes in the remaining time slots A; = 2, 3, . . . , A^. 



C. Invertibility Conditions for Unidirectional Transforms 

We now establish a set of invertibility conditions for unidirectional transforms. Note that these trans- 
forms are always computed in a particular order, e.g., starting from nodes furthest from the sink (i.e., 
"leaf" nodes), up to nodes which are 1-hop from the sink. Some sort of interleaved scheduling (where 
one set of nodes transmits before the rest) could also be used IIBTI . Therefore, it would also be desirable 
to have step by step decoding in the reverse order, since this would simplify the transform constructions. 
In particular, if the overall transform can be inverted by inverting the computations done at each node in 
the reverse order, then invertibility will be ensured by designing invertible transforms at each node. 

Step by step decoding in the reverse order is trivially guaranteed when no broadcast data is used since 
the transform at each node n is simply y„ = A„ • [x(n) y|, ]*. Thus, if each A„ is invertible, we can 
invert the operations done at node n as [x{n) yip ] * = (A„)~^ -yn. This becomes more complicated when 



broadcast data is used. By examining ([T|), we observe that y„ = A„ • \x{n) y^ ] * + 



B1 . . . B 



where yB„ = (i) • • • (|b |) • I^i order to have step by step decodability, we need to be able to 
recover (for every node n) x{n) and yx>„ from y„ and y^^ . Note that this fails whenever we cannot decode 
some transform coefficient vector y^ from broadcast node m £ Bn before decoding y„. It will also fail 
if the matrix operations performed at any given node are not invertible. Thus, in order to guarantee step 
by step decodability, we need to ensure that (i) the matrix operations at each node are invertible, and (ii) 
it is possible to decode each y^ before decoding y„. As we now show, (i) is guaranteed by ensuring 
that each A„ matrix is invertible and (ii) is guaranteed by imposing a timing condition. 
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Proposition 1 (Step by Step Decodability): Suppose that we have the transform in Definition [3] and 
assume that t{p{m)) > t{n) for every broadcast node m G Bn- Then we can recover x(n) and yD„ as 



• Yn - (Ar 



,-1 



Pile,. 

tin 



yg^ if and only if A„ ^ exists. 



Proof: Note that the vector transmitted by any broadcast node m G Bn will be processed at its 
parent, node p{m), and this processing will occur at time t{p{m)). Moreover, node n will generate its 
own transform coefficient vector y„ at time t{n), and by assumption we have that t{p{m)) > t{n). 
Thus, it is possible to decode y^ before y„ for every broadcast neighbor m € Bn- Thus, we can always 



form = 



yB„(i) • • • yB„(ie,.i) 



before decoding y„. Therefore, we can recover x{n) and yx>„ as 



• Yn 



,-1 



Bif" 



yB„ if and only if A„ ^ exists. 



To simplify our transform constructions, we also assume that nodes use the latest version of broadcast 
data that they receive, i.e., m e Bn only if Am n Bn = 0- This second constraint precludes the possibility 
that a node n receives broadcast data from node m and from an ancestor of node m. Removing the 
broadcast links which violate these constraints gives a simplified communication graph as shown on 
the right side of Fig. [3] Removal of these links can be done by local information exchange within the 
network; examples of how this can be achieved are discussed in Section III-DI Under the constraint of 
Prop. [U and this second constraint, we can represent the global transform taking place in the network as 
follows. Since the time slot assignment is unique, at time t{n) only data from n and its descendants will 
be modified, i.e., only x{n) and y{T>n) will be changed at time t{n). Since pre-order indexing is used, 
we have that yx)„ = [y{n + 1), . . . , y{n + Therefore, the global transform computations done 

at time t{n) are given by Q, where each y^ corresponds to data which is not processed at time t{n). 
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The global transform matrix Ci(„) at time t{n) is just the matrix shown in ([2]), i.e., 
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This yields the global transform coefficient vector 

y = Cat • Cat^i • • • Ci • x. 

Fig. m illustrates these transform computations. Initially, y = x = [x (1) x(2) . . . x(5)]*. At times 1 
and 2, nodes 3 and 5, respectively, transmit raw data to their parents. Therefore, the global matrices at 
times 1 and 2 are simply Ci = C2 = I. At time 3, node 4 produces 

x(3) 
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where and bi represent arbitrary values of the transform matrix used at node 4. Then at time 4, node 
2 produces transform coefficients y{2) and y(3) (and coefficient vector y2) as 
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where a[ and b[ are the values of the matrix used at node 2. Node 1 then computes yi at time 5. The 
global transform is given by 
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t(3) = 1 



t(2) = 4 




x(3) 



x(5) 



t(5)=2 



t(4) = 3 



Fig. 4. Example to illustrate unidirectional transform computations. Nodes generate and transmit transform coefficients in the 
order specified by the transmission schedule. 

It is now rather simple to show that the transform is invertible as long as each matrix A„ is invertible. 

Proposition 2 (Invertible Unidirectional Transforms): Suppose that we have the transform in Def. [3l 
the second timing constraint (m € Bn only if Am n Bn = 0) is met, and Prop. [T] is satisfied for every 
node n. Then the overall transform given by ^ is invertible. 

Proof: Under the two broadcast timing assumptions, the global transform is given by (01) is 
invertible if and only if every C4(„) in ([3]l is invertible. Ci(„) is invertible if and only if det (C((„)) 7^ 0. 
Recall that adding a multiple of one row to another does not change the determinant |[32l . Given the 
structure of the Ct(„) matrices, using such row operations to eliminate each matrix, it is easy to show 
that det (Cj(„)) = det(A„). Moreover, Prop. [T] implies that A„ is invertible. ■ 

Proposition |2] shows that locally invertible transforms provide globally invertible transforms. Moreover, 
under our stated timing constraints, broadcast data does not affect invertibility. Therefore, broadcast data 
at each node n can be used in an arbitrary manner without affecting invertibility. So in order to design 
an invertible unidirectional transform, all that one must do is design invertible matrices A„. This is an 
encouraging result since it essentially means that broadcast data can be used in any way a node chooses. 
In particular, broadcast data can always be used to achieve more data decorrelation. 

D. Discussion 

The theory presented thus far assumes that the routing and transmission scheduling are known, and 
that all of the transform matrices are known both at the nodes and at the sink. In practice, the routing, 
scheduling and transforms must be initialized. Moreover, the network may need to re-configure itself if, 
for example, nodes die or link conditions change drastically. In addition, packet losses will often occur. 
Nodes typically deal with this (as in CTP) by re-transmitting a packet until an acknowledgement (ACK) 
is received from the intended recipient. While these three issues pose no significant problems for routing. 
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they all have an impact on our proposed transform due to the assumptions we make about timing. We 
now provide some discussion of how this affects our theory and how it can be handled. 

We first address the impact that initialization and reconfiguration have on the routing and scheduling, 
as well as what can be done to address it. We assume that routing is initialized and reconfigured in a 
distributed manner using standard protocols such as CTP. Distributed scheduling protocols for WSNs also 
exist ||29l . ll33l . However, the resulting schedules may not be consistent with Definition |2] (i.e., they may 
not provide timings for which t(m) < t{n) for all m E so in practice we would need to enforce 
such timings. One way to achieve this is to force nodes to suppress transmission (in a given epoch) until 
they have received data from all of their descendants. Another alternative would be to determine such a 
transmission schedule at the sink, then to disseminate the timing information to the nodes. 

Whenever timing and routing information is established (or re-established due to re-configuration), it 
is also necessary to check our main broadcast timing constraint, i.e., m & Bn only if t{n) < t{p{m)). We 
describe one way in which this information can be disseminated to each node in a distributed manner. 
First, whenever the time t{n) at node n is initialized or changes, it broadcasts a small packet (i.e., a 
beacon) which contains t{n) to its children. Then, any child of n which broadcasts data will send the 
same beacon to all of its neighbors. This requires a total of 2 messages for each broadcasting node. 
Note that protocols such as CTP already use control beacons (in addition to data packets) to update 
stale routing information. Thus, nodes could potentially piggyback timing information on these control 
beacons whenever they are generated, or otherwise use separate control beacons to disseminate timing 
information. This will incur an additional cost, although (as was shown in 111) the per packet cost for 
control beacons is typically much smaller than the cost for data forwarding. 

Initialization and re-configuration also impacts the transform matrices that are used. Each node could 
transmit the values of its matrix to the sink, or viceversa, but this may be very costly. Instead, the 
construction of each transform matrix should be based on a small amount of information which is made 
common to the nodes and to the sink. For example, the values in each transform matrix could be based 
on the number of 1-hop neighbors that each node has |20] or the relative node positions |[T4l . In this 
way each matrix can be constructed at each node and at the sink without explicitly communicating the 
matrix values. However, additional information (e.g., node positions, number of neighbors) would need 
to be communicated to the sink whenever the network is initialized or re-configured. For example, each 
node could construct a transform using only the number of nodes that it receives data from (as in |20|, 
031) and would send the set of nodes whose data it used as overhead to the sink. Then, assuming that 
the nodes and the sink construct the matrices according to the same rules, the sink can re-construct the 
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matrix used at each node. 

Packet loss is the last practical issue which impacts our proposed transforms. We do not consider the 
effects of channel noise on the data since these can be handled using a wide variety of existing techniques. 
Moreover, packet losses and channel noise will impact other data gathering schemes (e.g., CTP), and we 
expect that the penalty due to packet losses will be similar in our scheme and in other data gathering 
schemes. Packet losses are typically handled (as in CTP) by re-transmitting a packet until an ACK is 
received from the desired destination. Thus, if node n does not receive data from descendant n + k hy 
the time that it transmits, due to packet re-transmissions for n -\- k, the data from node n + k cannot be 
combined with data available at node n. This is equivalent to not using the data from node n + k in the 
transform computation (i.e., A„(j, k + 1) = A„(A; + 1, j) = for all j 7^ + 1 and A„(/c + 1, A: + 1) = 1) 
and does not affect our proposed theory. However, this change must be signaled to the sink so that it 
knows how to adjust A„ accordingly. This can be done by including some additional information in the 
packet headers for node n and n + k to signify this change. 

Packet losses also have an impact on the use of broadcast data. Suppose that node n does not receive 
a data packet from broadcast neighbor but the packet from bk does reach the intended recipient p{bk)- 
In this case, node p{bk) will send an ACK back to node bk and node hk will no longer re-transmit (note 
that node will not expect an ACK from node n). Thus, data from node bk can not be combined with 
data available at node n. This is equivalent to not using data from node bk in the transform computation 
(i.e., = 0) and our proposed theory is not affected. However, this change must be signaled to the 
sink so that it knows to set BJ^ = 0. 

One way to work around these issues (initialization, re-configuration and packet losses) is to design 
transforms that can work under arbitrary timing and with arbitrary uses of broadcast data. However, 
under arbitrary timing and use of broadcast data, it is no longer possible to guarantee global transform 
invertibility by designing invertible transforms at each node. More specifically, we must ensure that the 
transform computations done at different nodes are jointly invertible. This leads to a set of complex 
conditions. The cost to determine such conditions and to coordinate nodes so that they satisfy these 
conditions could be very high, perhaps even much higher than the additional coordination needed to 
implement our proposed transforms. However, it is still possible to design simple versions of such 
transforms by using constructions such as lifting. Our recent work ||3T| is one particular example. Given 
this high degree of complexity to ensure an invertible transform when using broadcast, broadcast data 
should probably only be used with our proposed transforms if (i) it is possible to fix the timing in the 
network in accordance with the Definition |2j and, (ii) the timing is very stable. 
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III. Unidirectional Transform Designs 

Proposition |2] provides simple conditions for invertible transform design, i.e., A„ is invertible for every 
node n. This is a simple design constraint that unifies many existing unidirectional transforms. In this 
section, we demonstrate how existing unidirectional transforms can be mapped to our formulation. In 
particular, we focus on the tree-based Karhunen-Loeve Transform (T-KLT) |[28]| . T-DPCM ll22l . |[28i l and 
early forms of tree-based wavelet transforms |[T6l . lITTl . |[T8l . jlOll constructed using lifting ITTl . 

In order to exploit spatial correlation to achieve reduction in the number of bits per measurement, nodes 
must first exchange data. Therefore, some nodes must transmit raw data to their neighbors before any form 
of spatial compression can be performed. Since raw data typically requires many more bits than encoded 
transform coefficients, it would be desirable to minimize the number of raw data transmissions that nodes 
must make to facilitate distributed transform computation. Therefore, our main design consideration is 
to minimize the number of raw data transmissions that are required to compute the transform. 



A. Tree-based Karhunen-Loeve Transform 

Since transforms that achieve data decorrelation potentially lead to better coding efficiency |[34]| . 

we consider now the design of unidirectional transforms that achieve the maximum amount of data 

decorrelation. This can be achieved by applying, at each node n, a transform A„ that makes all of the 

coefficients in y„ statistically uncorrelated (or "whitened"), e.g., by using a Karhunen-Loeve transform 

(KLT) at each node, leading to the T-KLT described in our previous work |[28ll . In this transform, each node 

n computes and transmits a set of "whitened" coefficients y„, which will then have to be "unwhitened" 

and then re-whitened at p{n) to produce a new set of whitened coefficients. Whitening can be done using a 

KLT and unwhitening can be achieved using an inverse KLT. More specifically, this is done at each node n 

by (i) finding the whitening transform H„ and unwhitening transforms of each child Gc„ (i) , (ii) applying 

an unwhitening transform to each child to recover the original measurements as = Gc^(i) • yc„(i)' 

1 1 



and then (iii) rewhitening these measurements as y„ = 
(without quantization) can then be expressed in terms of our formulation as 

1 ••• 
Gc„(i) ••• 



"C„{|C„|) 



This transform 











'C„(|C„|) 





x{n) 




yc„(i) 




_ yc„(|c„i) 



(6) 



with A„ = H„ -diag (l, Gc„(-i), . . . , Gc^[\c„\))- Each A„ is trivially invertible since H„ and each Gc^{i) 
are invertible by construction. Therefore, the tree-based KLT is trivially invertible. 
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B. Tree-based DPCM 



A simpler alternative to the T-KLT is T-DPCM ||22l, ||28l. A related DPCM based method was proposed 
in |[35l . This particular method is not designed for any particular communication structure, but it can 
easily be adapted to take the form of a unidirectional transform. In contrast to the method in [35], the 
T-DPCM methods in ll22l . ll28l compute differentials directly on a tree such as an SPT 

In the T-DPCM method of ll28l . each node n computes its difference with respect to a weighted average 
of its children's data, i.e., y{n) = x{n) — J2m.eC ^n{'m)x{m). For this to be possible, one of two things 
must happen: either every node n must decode the differentials received from its children to recover 
x{m) for each m ^ Cn, or, every node n must transmit raw data two hops forward to its grandparent (at 
which point y{n) can be computed) to avoid decoding data at every node. In order to avoid each node 
having to forward raw data two hops, at each node n, the inverse transform on the data of each child 
Cn{i) must be computed first using the inverse matrix (Ac,^(j)) ^ of each child. The forward transform 
is then designed accordingly. We can express this version of T-DPCM as in ([7]). 



1 





-a„(D„) 
I 



1 




,-1 



^C„{|C„|)J 





, -1 





x{n) 




yc4i) 




_ yc„(ic„i) 



(V) 



••• I 

The matrix A„ is just the product of these triangular matrices, hence, it is trivially invertible. Moreover, 
only leaf nodes need to forward raw data and the rest transmit only transform coefficients. 

Alternatively, in the T-DPCM scheme of |22|, each node n first forwards raw data x{n) to its parent 
p{n), then node p(n) computes a differential for n and forwards it to the sink, i.e., node p(n) computes 
y{n) = x{n) — a„(p(n))x(p(n)). This transform can also be mapped to our formalism as 

1 x{n) 
-ax>„ (n) I yx>„ 

This eliminates the computational complexity of the previous T-DPCM method since no decoding of 
children data is required. However, every node must now forward raw data one hop. Moreover, it will 
not decorrelate the data as well as the first method since only data from one neighbor is used. 



Yn 



(8) 



C. Unidirectional Lifting-based Wavelets 

We now describe how unidirectional wavelet transforms can be constructed under our framework. This 
can be done using lifting lIZTll . Lifting transforms are constructed by splitting nodes into disjoint sets of 
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even and odd nodes, by designing prediction filters, which alter odd data using even data, and update 
filters, which alter even data based on odd data. They are invertible by construction |[27l . 

First, nodes are split into odd and even sets O and £, respectively. This can be done completely 
arbitrarily. One example from our previous work |[20l is to split according to the depth in the tree, e.g., 
O = {n : h{n) mod 2 = 1} and £ = {m : h{m) mod 2 = 0}, as illustrated in Fig. |5] Data at each 
odd node n € O is then predicted using data from even neighbors J\fn C £, yielding detail coefficient 

d{n) = x{n) - ^ pnii)x{i). (9) 

The prediction vector p„ can provide a simple average ll20l . i.e., Pn{i) = for each i € A/"„, a planar 
prediction |[T5l of the data at node n using data from its neighbors, or can even be data adaptive |28|. 
Incorporating some broadcast data into the prediction is also useful since it allows odd nodes to achieve 
even further decorrelation. After the prediction step, data at each even node m € £" is updated using 
details from odd neighbors Mm C O, yielding smooth coefficient 

s{m) = x{m) + ^ Um{j)d{j). (10) 

The update vector can provide simple smoothing ||20l, i.e., u^O) = 2.\j\f J ^ -^m, or could 

provide orthogonality between smooth (i.e., low-pass) and detail (i.e., high-pass), coefficients [361. 




Fig. 5. Example of splitting based on the depth of the routing tree. White (odd depth) nodes are odd, gray (even depth) nodes 
are even and the black center node is the sink. 

By the lifting construction, invertibility will be guaranteed as long as (i) odd node data is only predicted 
using even node data, and (ii) even node data is only updated using details from odd nodes. So if £ and 



January 27, 2010 



DRAFT 



18 



O is an arbitrary even and odd split, the transform computed at each node will be invertible as long as 
the computations satisfy (i) and (ii). More formally, let On = (n U Vn) n O be the set of odd nodes 
whose data is available at n from its subtree. Let £n = {nU P„) n £^ be defined similarly. Moreover, let 
of = Bnf^O denote the set of odd nodes whose data n receives via broadcast. Similarly, let £^ = BriHS. 
Then the computations at n will be invertible as long as it only predicts y{On) from y{£n) and y{£n) 
and only updates y{£n) from y{On) and y{0^). Let M„ and be permutation matrices such that 

^ M„ 




y{£n) 

Then n can compute transform coefficients as in (fT2l 



x{n) 

y{Bn) 



(11) 
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u„ I 



I P„ 
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) together, we 
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p„ 


_ U„ I _ 







I 



y[On) 
y{£n) 
y{o^) 
y{£^n) 

■ YbJ', with 



(12) 



A„ = (M„)* 

_ U„ 

B„ = (M„)*- ^ " -Mf 

Since det(A„) = 1, single-level unidirectional lifting transforms are always invertible. 

The transform given by ([T2l) corresponds to only one level of decomposition. In particular, at each 
node n the transform of (fT2l) will yield a set of smooth (or low-pass) coefficients {y{k)}ke£,^ and a set of 
detail (or high-pass) coefficients {y(0}ieC'„- The high-pass coefficients will typically have low energy if 
the original data is smooth, so these can be encoded using very few bits and forwarded to the sink without 
any further processing. However, there will still be some correlation between low-pass coefficients. It 
would therefore be useful to apply additional levels of transform to the low-pass coefficients at node 
n to achieve more decorrelation. This will reduce the number of bits needed to encode these low-pass 
coefficients, and will ultimately reduce the number of bits each node must transmit to the sink. 

Suppose each node performs an additional J levels of lifting transform on the low-pass coefficients 
{y{k)}ke£„- At each level j = 2, 3, . . . , J + 1, suppose that nodes in <Sit^ are split into even and odd 
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sets £n and On, respectively. We assume that ^"^ = £n- For each odd node / G On, we predict y{l) using 
even coefficients from some set of even neighbors A// C £n, i.e., y{l) = y{l) — J2keAfi^ Vi,j{^)y{^)- 
Then for each even node k € £n, we update y{k) using odd coefficients from some set of odd neighbors 
Ml C On, i.e., y{k) = y{k) + X^^g^^^j Ukj{l)y{l). This decomposition is done starting from level j = 2 



up to level j = J + 1. For all j = 2, 3, 



, J + 1, let M:^ be a permutation matrix such that 

yioi) 

y{£L) = ■ Yr. 
y{ni) 



(13) 



where IZn = (n U T>n) — {On U £n) is the set of nodes whose coefficients are not modified at level j. 
Then we can express the level j transform computations in matrix form as 



(14) 



where 'Pn and 15 n represent the prediction and update operations used at level j, respectively. 

By combining ([TT]), Gill, GSJl and (O, we finally get that y„ = [A„ B„] • \x{n) y|, ]*, with A„ 
and B„ defined in ( fTSl ) and ( fT6l ). 
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(16) 



Prop. [2] implies that the overall transform is invertible if A„ given in ( fTSl ) is invertible. Since each M:^ 
is a permutation matrix, |det(M:^)| = 1. Moreover, the remaining matrices are triangular. Thus, it easily 
follows that det(A„) = 1. Therefore, unidirectional, multi-level lifting transforms are always invertible. 



D. Unidirectional 5/3-like Wavelets 

Our previous work II20I provides a 5/3-like transform on a tree. First, nodes are split into odd and even 
sets O and £, respectively, by assigning nodes of odd depth as odd and nodes of even depth as even. More 
specifically, O = {n : h{n) mod 2 = 1} and £ = {m : h{m) mod 2 = 0}. This is illustrated in Fig. [5] 
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The transform neighbors of each node are simply Mn = {pin)} U C„ for every node n. This provides 
a 5/3-like wavelet transform on a tree since whenever averaging predictions and smoothing updates are 
used along a ID path, the transform reduces to the 5/3 wavelet transform 137]. Nodes can compute these 
transforms in a unidirectional manner, but doing so requires that some nodes forward raw data 1 or 2 
hops. This is illustrated in Fig. [6] 



y7 = M7ll 




|y5 = [x(5ii CBj = 
y4 = M4)i (3) / 



\ 



y3 = [x(3)x(4)x(6)l' 




ye = [x(6)x(7)x(8 



Nodes 4, 5, 7, 8 tx raw data 



Nodes 3, 6 tx raw data 



Fig. 6. Raw data transmissions for 5/3-like transform. Nodes 3 and 6 need x{2) to compute details d(3) and d(6), so they 
must forward raw data over 1-hop to node 2. Nodes 4 and 5 need d(3) to compute s(4) and s(5), so they must forward raw 
data over 2-hops. 



Data from each odd node n is predicted using data x(C„) (from children C„) and x{p{n)) (from 
parent p{n)). However, odd node n will not have x{p{n)) locally available for processing. Therefore, 
we require that each odd node n transmit raw data x{n) one hop forward to its parent p{n), at which 
point node p{n) can compute the detail coefficient of n. Each even node m will then compute detail 
d{j) = x{j) — X^iec PKi)^(i) ~ Pj(m)x(m) for every child j € Cm- Similarly, the smooth coefficient 
of each even node m requires details from its parent p{m) and children Cm, so it can not be locally 
computed either. Moreover, detail d{p{m)) can only be computed at node p'^{m), i.e., at the grandparent 
of m. Therefore, we require that even node m transmit raw data x{m) two hops forward to p'^{m), at 
which point d{p{m)) will be available and p^{m) can compute s{m) = x{'m)+'}2j(zi^p{m)}ijc ^rn{j)d{j). 
Note that each of these operations are trivially invertible, and easily lead to local transform matrices A„ 
which are invertible by construction. However, the number of raw data transmissions is relatively high, 
i.e., 1-hop for odd nodes and 2-hops for even nodes. We address this inefficiency in the next section. 
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IV. Unidirectional Haar-like Wavelets 

For the transform in Section IIII-DI raw data from even and odd nodes must be forwarded over 2-hops 
and 1-hop, respectively. This can be inefficient in terms of transport costs. Instead, it would be better to 
construct a lifting transform which directly minimizes the number of raw data transmissions each node 
must make. We use the splitting method in Section IIII-DI Note that some form of data exchange must 
occur before the transform can be computed, i.e., evens must transmit raw data to odds, or viceversa. 
Suppose that even nodes forward raw data to their parents. In this case, the best we can do is to design 
a transform for which even nodes transmit raw data over only 1-hop, and odd nodes do not transmit any 
raw data. This will minimize the number of raw data transmissions that nodes need to make, leading to 
transforms which are more efficient than the 5/3-like transform in terms of transport costs. We note that 
minimizing raw data only serves as a simple proxy for the optimization. A more formal optimization 
which relies on this same intuition is undertaken in our recent work f3T\ . 

A. Transform Construction 

A design that is more efficient than the 5/3-like transform can be achieved as follows. Note that an odd 
node n has data from its children C„ and/or even broadcast neighbors BnCiS locally available, so it can 
directly compute a detail coefficient for itself, i.e., (i(n) = x{n) — J2^^f^ Pn{i)x{'>') — J2jeB nf P»^(j)^(^)- 
Thus, the detail d{n) is computed directly at n, is encoded, and then is transmitted to the sink. These 
details require fewer bits for encoding than raw data, hence, this reduces the number of bits that odd 
nodes must transmit for their own data. Since data from even node m is only used to predict data at its 
parent p{m), we simply have that Mm = {p{m)} and s{m) = x{m) + u„i{p{m))d{p{m)). Moreover, 
these smooth coefficients can be computed at each odd node n. Therefore, even nodes only need to 
forward raw data over one hop, after which their smooth coefficients can be computed. Note that not all 
odd nodes will have children or even broadcast neighbors, i.e., there may exist some odd nodes n such 
that C„ = and H £" = 0. Such odd nodes can simply forward raw data x{n) to their parent p{n), 
then p(n) can compute their details as d{n) = x{n) — pn{p{n))x{p{n)). Thus, there may be a few odd 
nodes that must send raw data forward one hop. This leads to a Haar-like transform which is exactly 
the Haar wavelet transform when applied to ID paths. 

Odd nodes can also perform additional levels of decomposition on the smooth coefficients of their 
descendants. In particular, every odd node n will locally compute the smooth coefficients of its children. 
Therefore, it can organize the smooth coefficients {s(A;)}fcgc„ onto another tree and perform more 
levels of transform decomposition along T^. In this work, we assume is a minimum spanning tree. This 
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produces detail coefficients {d2{k)}keoi^ {c^3(^)}fceC'3 , • • {dj+i{k)}j^^Qj+i and smootli coefficients 
{sj+i(fc)}^g£-j+i for some J > 0. In tliis way, odd nodes can further decorrelate the data of their children 
before they even transmit. This reduces the resources they consume in transmitting data. An example of 
this separable transform for J = 1 is illustrated in Fig. |7J By choosing averaging prediction filters and 
the orthogonalizing update filter design in |[36l . we get the global equation in iT7\ . 
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(17) 



The coefficient vector ye is obtained in a similar manner. More generally, these sorts of multi-level 
transform computations can always be formulated into matrices as described in Section IIII-CI 




d(3) = x(3) - [x(4) + x(5)] / 2 
s(4) = x(4) + d(3) / 3 
s(5) = x(5) + d(3) / 3 



d(6) = x(6) - [x(7) + x(8)] / 2 
s(7) = x(7) + d(6) / 3 
s(8) = x(8) + d(6) / 3 



ya = [dO) d2(4) S2(5)l' 



d2(4) = s(4) - s(5) 
S2{5) = s(5) + d2{4) / 2 




Ve = [d(6) d2(7) S2(8)]' 



d2(7) = s{7) - s(8) 
S2(8) = s(8) + d2(7) / 2 



(a) 1" -level along T 



(b) 2"''-level "orthogonal" to T 



Fig. 7. Unidirectional Computations for Haar-like Transform. In (a), nodes 3 and 6 compute a first level of transform. Then 
in (b), nodes 3 and 6 compute a second level of transform on smooth coefficients of their children. 



B. Discussion 

The transform computations that each node performs can be easily mapped into our standard form 
Yn = [An B„] • [x(n) yl, ]* appropriately populating the matrices in ( [15] ) and ([T6l ). Therefore, 
they will always yield invertible transforms. For example, since each odd node n predicts its own data 
x(n) using data from its children C„ and even broadcast neighbors Bn n £, then updates the data of its 
children from its own detail, the operations for a single level transform at odd n can be expressed as 

x(n) 
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ux),^ (n) I 



1 -p„(P„) -pn{Bn) 
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(18) 
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By choosing 
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1 -Pn{Bn 

u-p,^ (n) I I 

we have that y„ = [A„ B„] • [x{n) ]*. Note that ([TSl l covers all of the cases discussed in 

Section lEAl for each odd node n, that is to say: (i) C„ 7^ and n £^ / 0, (ii) C„ = and fi„ n ^ / 0, 
(iii) Cn / and ,B„n£' = 0, and (iv) C„ = and S„n£^ = 0. In particular, whenever Cn 7^ 0, Pn(^^n) and 
ui)„(n) will have some non-zero entries. Otherwise, n has no descendants and so p„(D„) and ujj^X^) 
will just be vectors of zeros. Similarly, whenever S„ n £^ 7^ 0, p„(i3„) will have some non-zero entries. 
Otherwise, n has no even broadcast neighbors and Pn{Bn) will be a vector of zeros. 

Similarly, each even node m may need to compute predictions for its odd children, so its computations 
for a single level transform can be expressed as 

1 
-Po,„ (m) I 

Also note that (|2TI ) covers all of the cases for each even node m discussed in Section IIV-AI i.e., when 
m has to compute predictions for children then px),„ (m) 7^ 0, otherwise, px),„ (m) = 0. 

Note that, when broadcast data is used, the decorrelation achieved at odd nodes may still be comparable 
to the 5/3-lLke transform since the same number of neighbors (or more) will be used. Moreover, broadcasts 
are particularly useful for odd nodes n that have no children, i.e., n for which Cn = ^ but Bn £ ^ 
If broadcast data is not used when it is available, node n will have to transmit x{n) to its parent. Since 
x{n) requires more bits for encoding than does a detail coefficient d{n), n will consume more resources 
during data transmission. By using broadcasts, these odd nodes which have no children can still use data 
overheard from even broadcast neighbors, allowing them to avoid transmitting raw data to their parents. 
This is illustrated in Fig. [H where node 1 1 has no children but overhears data from node 12. The example 
in Fig. |8(a)| will consume more resources at node 1 1 than will the example in Fig. |8(b)[ 





x(m) 







V. Experimental Results 

This section presents experimental results that compare the transforms proposed here against existing 
methods. Source code used to generate these results can be found on our webpageO. In particular, we 

^http://biron.usc.edu/wiki/index.phpAVavelets_on_Trees 
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(a) Without Broadcasts (b) With Broadcasts 

Fig. 8. No broadcasts are used in (a), so node 11 consumes more resources when transmitting raw data a;(ll). Broadcasts are 
used in (b), so node 11 consumes less resources when transmitting detail ci(ll). 

focus on comparing the proposed multi-level Haar-like lifting transforms against the multi-level 5/3- 
like transform from |[20l . |[26l . the T-DPCM scheme in ll28l and raw data gathering. We consider the 
application of distributed data gathering in WSNs. Performance is measured by total energy consumption. 

A. Experimental Setup 

For evaluation, we consider simulated data generated from a second order AR model. This data consists 
of two 600 X 600 2D processes generated by a second order AR model with low and high spatial data 
correlation, e.g., nodes that are a certain distance away have higher inter-node correlation for the high 
correlation data than for the low correlation data. More specifically, we use the second order AR filter 
H{z) = (i_peJ^Q^-i)(\-pe-j"o^-i) ' with p = 0.99 and ujq = 99 (resp. wq = 359) to produce data with 
low (resp. high) spatial correlation. The nodes were placed in a 600 x 600 grid, with node measurements 
corresponding to the data value from the associated position in the grid. Each network used in our 
simulations is generated from a set of random node positions distributed in the 600 x 600 grid. An SPT 
is constructed for each set of node positions. We consider two types of networks: (i) variable radio range 
networks in which each node can have a diferent radio range, and {ii) fixed radio range networks in which 
each node has the same radio range. In the variable radio range case, the radio range that each node n 
uses for transmission is defined by the distance from n to its parent in the SPT. Additional broadcast 
links induced by the SPT are also included, i.e., a broadcast link between node n and m exists if m is 
not a direct neighbor of n in the SPT but is within radio range of n. 

In order to measure energy consumption, we use the cost model for WSN devices proposed in 161, |[38l . 
where the energy consumed in transmitting k bits over a distance D is ET{k,D) = Eeiec - k + Eamp ■ k ■ D'^ 
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Joules and the energy consumed in receiving k bits is Eji{k) = E^iec • k Joules. The E^iec • k terms 
capture the energy dissipated by the radio electronics to process k bits. The Samp • k ■ term captures the 
additional energy for signal amplification needed to ensure reasonable signal power at the receiver. WSN 
devices also consume energy when performing computations, but these costs are typically very small 
compared with transmission and reception costs. Therefore, we ignore them in our cost computations. 
Also note that all data gathering schemes will suffer from channel noise and attenuation, so a no-channel- 
loss comparison is still valid. Thus, we do not consider these effects in our experiments. 

Comparisons are made with the Haar-like transforms of Section |IV] against the 5/3-like transform with 
delayed processing proposed in |[26l and the T-DPCM scheme proposed in |[28l . Predictions for each of 
these transforms are made using the adaptive prediction filter design in ll28l . Updates are made using the 
"orthogonalizing" update filter design in ||36l . In each epoch, we assume that each node transmits M = 50 
measurements taken at M different times. Also, each raw measurement is represented using Br = 12 
bits. We assume each odd node encodes M detail coefficients together with an adaptive arithmetic coder. 
Smooth coefficients are treated like raw data, i.e., each one uses Br bits. Since we only seek to compare 
the performance of spatial transforms, we do not consider any temporal processing. 



B. Simulation Results 

In the case of lossless compression, the average cost reduction ratios taken over multiple uniformly 
distributed networks are shown in Fig. |9] for high and low data correlation. These are expressed as the 
average of multiple values of {Cr — Ct)/Cr, where Ct is the cost for joint routing and transform and Cr 
is the cost for raw data forwarding. Results for variable radio ranges (each node has different radio range) 



are shown in Fig. |9(a)[ Results for fixed radio ranges (each node has the same radio range) are given in 
Fig. |9(b)| T-DPCM does the worst overall. The 5/3-like transform provides significant improvement over 
the simple T-DPCM scheme. The Haar-like transforms have the highest average cost reduction ratio, or 
equivalently, the lowest average cost. Moreover, we note that broadcast is not very helpful (on average) 



when nodes have variable radio ranges (Fig. 9(a) i, but there is a significant gain when nodes use a fixed 
radio range (Fig. |9(b)[ ). This is mainly because, in the fixed radio range case, (i) there are many more 
opportunities for using broadcast data and (ii) each node has more broadcast neighbors. 

Note that the amount of raw data forwarding needed to compute the Haar-like transform is significantly 
reduced compared with the 5/3-like transform. Therefore, the Haar-like transform will do better than the 
5/3-like transform in terms of transport costs. Granted, the 5/3-like transform will use data from more 
neighbors for processing, so the decorrelation given by the 5/3-like transform will be greater than that 
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Average Percent Cost Reduction Average Percent Cost Reduction 

a65r 0.5r 




0.2 1 < < < < ' 0.1 1 < < < < ' 

100 200 300 400 500 100 200 300 400 500 

No. of Sensors No. of Sensors 



(a) Variable Radio Range (b) Fixed Radio Range 

Fig. 9. Average percent cost reduction ( ). Solid and dashed lines correspond to high and low spatial data correlation, 
respectively. Best performance achieved by Haar-like transforms, followed by 5/3-lik;e transform and T-DPCM. High coiTelation 
data also gives greater cost reduction than low correlation data. 



given by the Haar-like transform. However, in our experiments the average reduction in rate that the 
5/3-like transform provides over the Haar-Uke transform is rather small. The Haar-like transform with 
broadcast also provides additional cost reduction over the Haar-like transform without broadcasts since 
less raw data forwarding is needed on average. Moreover, the amount of cost reduction achievable is 
higher for the high correlation data than for the low correlation data. 

Lossy coding is also possible and can provide even greater cost reductions while introducing some 
reconstruction error. In this case, we quantize transform coefficients with a dead-zone uniform scalar 
quantizer. Performance is measured by the trade-off between total cost and distortion in the reconstructed 
data, which we express as the signal to quantization noise ratio (SNR). Sample 50 node networks are 
shown in Figs. 10(a) and |10(c)| and, in the case of high correlation data, the corresponding performance 
curves are shown in Figs. |10(b)| and |10(d)[ The Haar-like transforms do the best among all transforms. 

When using broadcasts with the Haar-like transform, there is an additional 1 dB (resp. 2.5 dB) gain in 
SNR for the variable (resp. fixed) radio range network at a fixed cost, i.e., by using broadcasts we can 
increase the quality in the reconstructed data for a fixed communication cost. Thus, for these networks, 
using broadcast is quite helpful. Also note that there only 2 broadcast links used in the transform for 



the variable radio range network (Fig. |10(a)| ), whereas there are over 10 broadcast links used in the fixed 
radio range network (Fig. |10(c)| ). Thus, broadcast provides even greater gains for the fixed radio range 
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Transform Structure on SPT (Variable RR) SNR vs. Energy Consumption (Variabie RR) 




Totai Energy Consumption (Jouies) 



(a) Variable RR Network (b) Cost-Distortion Curves (Variable RR) 

Transform Structure on SPT (Fixed RR) SNR vs. Energy Consumption (Fixed RR) 




too 200 300 400 500 600 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 

Totai Energy Consumption (Jouies) 



(c) Fixed RR Network (d) Cost-Distortion Curves (Fixed RR) 

Fig. 10. Sample networks with corresponding Cost-Distortion curves. In (a) and (c), blue lines denote forwarding links, dashed 
magenta lines are broadcast links, green circles are even nodes, red x's are odd nodes, and the black center node is the sink. 



network (2.5 dB versus IdB) since there are more broadcast links. More generally, broadcast should 
provide more gains in networks where many broadcast opportunities are available. 

Also note that in this particular network for the variable radio range case, T-DPCM actually does 
better than the 5/3-like transform. Note that in T-DPCM, only the leaf nodes forward raw data to the 
sink; so if there are only a few leaf nodes, the raw data forwarding cost for T-DPCM may not be very 
high compared with the raw data forwarding cost for the 5/3-like transform. In this particular network, 
only 19 of the 50 nodes are leaves in the tree. Therefore, the raw data forwarding cost for T-DPCM in 
this case is lower than that for the 5/3-like transform. However, on average the raw data forwarding cost 
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for T-DPCM will be very high (see Fig. |9l), leading to higher total cost on average as compared with the 
5/3 -like transform. 

VI. Conclusions 

A general class of en-route in-network (or unidirectional) transforms has been proposed along with 
a set of conditions for their invertibility. This covers a wide range of existing unidirectional transforms 
and has also led to new transform designs which outperform the existing transforms in the context 
of data gathering in wireless sensor networks. In particular, we have used the proposed framework to 
provide a general class of invertible unidirectional wavelet transforms constructed using lifting. These 
general wavelet transforms can also take into account broadcast data without affecting invertibility. A 
unidirectional Haar-like transform was also proposed which significantly reduces the amount of raw data 
transmissions that nodes need to make. Since raw data requires many more bits than encoded transform 
coefficients, this leads to a significant reduction in the total cost. Moreover, our proposed framework 
allows us to easily incorporate broadcasts into the Haar-like transforms without affecting invertibility. 
This use of broadcast data provides further performance improvements for certain networks. 
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